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Abstract 



A strong direct product theorem states that, in order to solve k instances of a problem, if we 
provide less than k times the resource required to compute one instance, then the probability 
of overall success is exponentially small in k. In this paper, we consider the model of two- 
way public-coin communication complexity and show a strong direct product theorem for all 
relations in terms of the smooth rectangle bound, introduced by Jain and Klauck [TJ] as a 
generic lower bound method in this model. Our result therefore uniformly implies a strong 
direct product theorem for all relations for which an (asymptotically) optimal lower bound can 
be provided using the smooth rectangle bound, for example Inner Product [21] . Greater-Than, 
Set-Disjointness [25] [20], Gap-Hamming Distance (TJ [27] etc. Our result also implies near optimal 
direct product results for several important functions and relations used to show exponential 
separations between classical and quantum communication complexity, for which near optimal 
lower bounds are provided using the rectangle bound, for example by Raz [24] . Gavinsky [10| 
and Klartag and Regev |26j . In fact we are not aware of any relation for which it is known 
that the smooth rectangle bound does not provide an optimal lower bound. This lower bound 
subsumes many of the other lower bound methods, for example the rectangle bound (a.k.a the 
corruption bound) |25j . the smooth discrepancy bound (a.k.a the 72 bound) |23j which in turn 
subsumes the discrepancy bound, the subdistribution bound [14] and the conditional min-entropy 
bound [12]. 

We show our result using information theoretic arguments. A key tool we use is a sampling 
protocol due to Braverman [5], in fact a modification of it used by Kerenidis, Laplante, Lerays, 
Roland and Xiao [18] , 

1 Introduction 

Given a model of computation, suppose solving one instance of a given problem / with probability 
of success p < 1 requires c units of some resource. A natural question that may be asked is: how 
much resource is needed to solve f k , k instances of the same problem, simultaneously. A naive way 
is by running the optimal protocol for /, k times in parallel, which requires c • k units of resource, 
however the probability of overall success is p k (exponentially small in k). A strong direct product 
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theorem for / states that this is essentially optimal, that is if only o(k ■ c) units of resource are 
provided for any protocol solving f k , then the probability of overall success is at most p^ k \ 

Proving or disproving strong direct product theorems in various models of computation is a cen- 
tral task in theoretical computer science, which have been studied extensively for decades. Readers 
may refer to [20^ [T2"| [To] for a good discussion of known results in different models of computa- 
tion. In the present work, we consider the model of two-party two-way public coin communication 
complexity [30] and consider the direct product question in this model. In this model, there are 
two parties who wish to compute a joint function (more generally a relation) of their input, by 
doing local computation, sharing public coins and exchanging messages. The resource counted is 
the number of bits communicated between them. The textbook by Kushilevitz and Nisan [21] is 
an excellent reference for communication complexity. Much effort has been made towards investi- 
gating direct product questions in this model and strong direct product theorems have been shown 
for many different functions, for example Set-Disjointness [20], [12] , Inner Product [22], Pointer Chas- 
ing [15] etc.; again please refer to [20l[12]|T5] for a good discussion. To the best of our knowledge, 
it is not known if the strong direct product theorem fails to hold for any function or relation in 
this model. Therefore, whether strong direct product theorems hold for all relations in this model, 
remains one of the major open problems in communication complexity. 

Another major focus in communication complexity has been to investigate generic lower bound 
methods, that apply to all functions (and possibly to all relations). In the model we are con- 
cerned with, various generic lower bound methods are known, for example the partition bound [131, 
information complexity [8], the smooth rectangle bound [13] , which in turn subsumes the rectan- 
gle bound (a.k.a the corruption bound) (29] [T] [25] [HJ 0] , the smooth discrepancy bound (a.k.a the 
72 bound) [23] which in turn subsumes the discrepancy bound, the subdistribution bound |14] and 
the conditional min- entropy bound |12j . Proving strong direct product results in terms of these 
lower bound methods is a reasonable approach to attacking the general question. Indeed, many 
lower bounds have been shown to satisfy strong direct product theorems, example the discrepancy 
bound [22], the subdistribution bound under product distributions [H], generalized discrepancy 
bound [28] and conditional min-entropy bound [12] . 

Our result 

In present work, we show a strong direct product theorem in terms of the smooth rectangle bound, 
introduced by Jain and Klauck [13], which generalizes the rectangle bound (a.k.a. the corruption 
bound) [29] [1] [25] [191 SJ- Roughly speaking, the rectangle bound for relation / C X x y x Z 
under a distribution fx, with respect to an element z 6 Z, and error e, tries to capture the size 
(under /i) of a largest rectangle for which z is a right answer for 1 — e fraction of inputs inside the 
rectangle. It is not hard to argue that the rectangle bound forms a lower bound on the distributional 
communication complexity of / under fi. The smooth rectangle bound for / further captures the 
maximum, over all relations g that are close to / under fj,, of the rectangle bound of g under \i. 
The distributional error setting can eventually be related to the worst case error setting via the 
well known Yao's principle [29] . 

Jain and Klauck showed that the smooth rectangle bound is stronger than every lower bound 
method we mentioned above except the partition bound and the information complexity. Jain and 
Klauck showed that the partition bound subsumes the smooth rectangle bound and in a recent 
work Kerenidis, Laplante, Lerays, Roland and Xiao [18] showed that the information complexity 
subsumes the smooth rectangle bound (building on the work of Braverman and Weinstein [6] who 
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showed that the information complexity subsumes the discrepancy bound). New lower bounds for 
specific functions have been discovered using the smooth rectangle bound, for example Chakrabarti 
and Regev's [7J optimal lower bound for the Gap-Hamming Distance partial function. Klauck [20] 
used the smooth rectangle bound to show a strong direct product result for the Set-Disjointness 
function, via exhibiting a lower bound on a related function. On the other hand, as far as we know, 
no function (or relation) is known for which its smooth rectangle bound is strictly smaller than 
its two-way public-coin communication complexity. Hence establishing whether or not the smooth 
rectangle bound is a tight lower bound for all functions and relations in this model is an important 
open question. Our result is as follows. 

Theorem 1.1. Let X,y,Z be finite sets, fCXxyxZbea relation, and t > 1 be an integer. 
For any < e < I, < 6 < 1 with s' d = 25(22e + S) 1 / 3 < 1, it holds that 

c,*. (/,) 4'''( £l ' !ir < ,(,) "?)' 

Our result uniformly implies a strong direct product result for all functions and relations for 
which optimal lower bound can be shown using the smooth rectangle bound, which includes many 
well-studied problems, example Inner Product [21], Greater-Than, Set-Disjointness[2U EQ] and Gap- 
Hamming Distance^ [27] etc. Our result also implies near optimal direct product results for sev- 
eral important functions and relations used to show exponential separations between classical and 
quantum communication complexity, for which near optimal lower bounds are provided using the 
rectangle bound, for example by Raz [23], Gavinsky [10] and Klartag and Regev |26j. 

Our techniques 

The broad argument of the proof of our result is as follows. We show our result in the distributional 
error setting and translate it to the worst case error setting using the well known Yao's principle 
[29j . Let / be a relation, ^ be a distribution on X x y, and c be the smooth rectangle bound of / 
under the distribution [i with output z € Z. Consider a protocol II which computes f k with inputs 
drawn from distribution n k and communication o(c • k) bits. Let C be a subset of the coordinates 
{1,2,... ,k}. If the probability that II computes all the instances in C correctly is as small as 
desired, then we are done. Otherwise, we exhibit a new coordinate j £ C, such that the probability, 
conditioned on success in C, of the protocol II answering correctly in the j-th. coordinate is bounded 
away from 1. Since \x could be a non-product distribution we introduce a new random variable Rj, 
such that conditioned on it and XjYj (input in the jth. coordinate), Alice and Bob's inputs in the 
other coordinates become independent. Use of such a variable to handle non product distributions 
has been used in many previous works, for example [2J [ITJ El H2J [15] . 

Let the random variables XjY^RjM 1 represent the inputs in the jth coordinate, the new 
variable Rj and the message transcript of II, conditioned on the success on C. The first useful 
property that we observe is that the joint distribution of XjYjRjjM can be written as: 

Pr[Xj Y^RjM 1 = xym] = -/j,(x,y)u x (rj,m)u y (rj,m), (1) 
x For a relation / with R? ub (/) = 0(1), a strong direct product result can be shown via direct arguments [12j . 
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where u x ,u y : M. — > [0, 1] are functions. The marginal distribution of XjYl is no longer fi though. 
However (using arguments as in [IS]), one can show that the distribution of XjYj is close, in i\ 

distance, to /i and l(xj : R)M l Y^j + l(Y/ : R)M l xfj < o(c). 

Now, assume for contradiction that the success in the jth coordinate in n is large, like 0.99, 
conditioned on success in C. Using the conditions obtained in the previous paragraph, we argue 
that there exists a zero-communication public-coin protocol n', between Alice and Bob, with inputs 
drawn from \i. In n' Alice and Bob are allowed to abort the protocol or output an element in Z. 
We show that the probability of non-abort for this protocol is large, like 2 _c , and conditioned on 
non-abort, the probability that Alice and Bob output a correct answer for their inputs is also large, 
like 0.99. This allows us to exhibit (by fixing the public coins of n' appropriately), a large rectangle 
(with weight under [i like 2 _c ) such that z is a correct answer for a large fraction (like 0.99) of the 
inputs inside the rectangle. This shows that the rectangle bound of /, under /i with output z, is 
smaller than c. With careful analysis we are also able to show that the smooth rectangle bound of 
/ under /x, with output z, is smaller than c, reaching a contradiction to the definition of c. 

The sampling protocol that we use to obtain the public-coin zero communication protocol, is 
the same as that in Kerenidis et al. [18], which in turn is a modification of a protocol due to 
Braverman [5^1 (a variation of which also appears in [6]). However our analysis of the protocol's 
correctness deviates significantly in parts from the earlier works [18} EJ [6] due to the fact that for 
us the marginal distribution of X x Y l need not be the same as that of //, in fact for some inputs 
(x,y), the probability under the two distributions can be significantly different. There is a nice 
property of the protocol n' that we observe which turns out to be very crucial in our arguments. 
The property is that the bad inputs (x,y) for which the distribution of n"s sample for RjM 1 , 
conditioned on non-abort, deviates a lot from the desired R X M 1 \ (X 1 Y 1 = xy), their probability 
is nicely reduced (as compared to Pr [A^Y 1 = xy\) in the final distribution of n', conditioned on 
non-abort. This helps us to argue that the distribution of inputs and outputs in n', conditioned 
on non-abort, is close in l\ distance to XjYj R^M 1 , implying good success in n', conditioned on 
non-abort. 



2 Preliminary 

Information theory 

We use capital letters X, Y, Z ■ ■ ■ or letters in bold a, b, a, /3 ■ ■ ■ to represent random variables and 
use calligraphic letters X,y,Z, - ■ ■ to represent sets. For integer n > 1, let [n] represent the set 
{1, 2, . . . , n}. Let X, y be finite sets and k be a natural number. Let X k be the set X x • • • x X, 
the cross product of X, k times. Let /i be a (probability) distribution on X. Let fi(x) represent 

def 

the probability of x € X according to [i. For any subset S C X, define [i(S) = SzesM^)- Let 
X be a random variable distributed according to /i, which we denote by X ~ /i. We use the same 
symbol to represent a random variable and its distribution whenever it is clear from the context. 

The expectation value of function / on X is denoted as E x .4_x [/(a^)] = f SxeA , ^ >r [^ = x ] ' f( x )- 

The entropy of X is defined as H(A) = f — Y2 X ^{ x ) ' logM^)- For two distributions /i, A on X, the 

distribution fi <g> A is defined as (fj, (g) X)(xi,X2) = f fJ-(x±) ■ X(x2). Let fi h d = [i ® ■ ■ ■ ® fj,, k times. 

2 A protocol, achieving similar task, however working only for product distributions on inputs was first shown by 
Jain, Radhakrishnan and Sen |17| . 
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def 

The i\ distance between \x and A is denned to be half of the l\ norm of [i — A; that is, ||A — /i||i = 
1 Ylx \^( x ) ~ / u ( x )l = max scx |A(5) — n(S)\. We say that A is e-close to \i if ||A — < e. The 



log 



2 t—ix 

relative entropy between distributions X and Y on X is defined as S(X||Y) = E^^x iu 5 Pl y_ x 

The relative min-entropy between them is defined as Soo(X||Y) *^= f max x6 ^ |log p^y~l^j | • It is 

easy to see that S(X|| Y) < S 00 (X||y). Let X, Y, Z be jointly distributed random variables. Let Y x 
denote the distribution of Y conditioned on X = x. The conditional entropy of Y conditioned on X 
is defined as H(Y|X) d = E^ ^H^)] = H(I7) - H(X). The mutual information between X and 
Y is defined as: l(X : Y) d = E(X) + H(Y) - H(XY) = E^y [S(XJX)] = E a . < _ x [S(y ai ||Y)] . The 

conditional mutual information between X and Y, conditioned on Z, is defined as: 1(X : Y\Z) = 
E Z ^ Z [I(X : Y|Z = «)] = H(X|Z) + H(Y|Z) - H(XY|Z) . The following c/wm raZe for mutual 
information is easily seen : I(X : YZ) = I(X : Z) + I(X : Y\Z) . 

We will need the following basic facts. A very good text for reference on information theory 

isie]. 

Fact 2.1. Relative entropy is jointly convex in its arguments. That is, for distributions fjr, A, A 1 € 
* andpe [0,1]: S(p/i + (1 -p)/i 1 ||A + (1 -p)A 1 ) <p-S(H|A) + (1 -p) • S^A 1 ) . 

Fact 2.2. Relative entropy satisfies the following chain rule. Let XY and X 1 Y 1 be random 
variables on X x y. It holds that: S(X 1 Y 1 ||XY) = S(X 1 ||X) +E x ^ x i [S (Y^Hy^] . In particular, 
S(X 1 Y 1 \\X ®Y) = S(X 1 \\X) + E !E <_ x i [S(YJ||Y)] > S(X 1 ||X) + S(Y 1 ||Y) . The last inequality 
follows from Fact 12.11 

Fact 2.3. Let XY and X 1 Y 1 be random variables on X x y. It holds that 
S(X 1 Y 1 \\X ®Y) > S(X 1 Y 1 ||X 1 (g) Y 1 ) =I(X 1 : Y 1 ) . 

Fact 2.4. For distributions A and fj,: < ||A — fi^ < y / S(X\\fi). 

Fact 2.5. [16] Let X, X' be two distributions on X. For any 5 € (0, 1), it holds that 



Pr 

xi-X' 



Pr[X - x] < 2 ( S (x'||X)+i)/5 



> 1 - 5. 



Pr[X = 

Fact 2.6. Given random variables XY and X'Y' on X X 3^, it holds that 

^[S(Yj||Y)] >^ ; [S(Yj||Y')] =I(X':Y'). 

Proo/. 

e^ ; [s(y;||y)] - e [s(y^||y')] = s{x'y'\\x'® y) -s(x'y'\\x'® y') >o. 

The equality holds by Fact 12.21 And the inequality follows by Fact 12.31 

□ 

We will need the following lemma for our later proofs. 
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Lemma 2.7. Given random variables A, A' and e > 0, if \\A — A'W^ < e, then for any r G (0, 1), 



Pr 

a-t— A 



Pr [A' = a] 



£ 

< - 

r 



> 1 — 2r; and Pr 

a<-A' 



Pi[A = a] 

Proof. Let G = [a : |l - < f }, then 



Pr [A' = a] 



Pr [A = a] 



£ 

< - 



> 1 - 2r - e. 



2e>Y^ | Pr [-A = a] - Pr = a] | > ^ \Pr[A 

a agG 



Pr [A' 



1 



Pr [4' = a] 



> Pr [a 0G] • -. 

a<— A r 



Pr [A = a] 

Thus Pr a ^_ J 4[a £ G] > 1 — 2r. The second inequality follows immediately. □ 
Communication complexity 

Let X,y,Z be finite sets, / C X x y x Z be a relation and e > 0. In a two-way public-coin 
communication protocol, Alice is given x € X, and Bob is given y £ y. They are supposed to 
output z € Z such that (x, y, z) 6 / via exchanging messages and doing local computations. They 
may share public coins before the inputs are revealed to them. We assume that the last [log \Z\~\ 
bits of the transcript is the output of the protocol. Let R,£ Ub (/) represent the two-way public-coin 
randomized communication complexity of / with the worst case error e, that is the communication 
of the best two-way public coin protocol for / with error for each input (x, y) being at most e. 
Let \x be a distribution on X x y. Let Dg (/) represent the two-way distributional communication 
complexity of / under distribution [i with distributional error e, that is the communication of the 
best two-way deterministic protocol for /, with average error over the distribution of the inputs 
drawn from fi, at most e. Following is Yao's min-max principle which connects the worst case error 
and the distributional error settings, see. e.g., [21} Theorem 3.20, page 36]. 



Fact 2.8. [30] RT b (./0 = max M Bf(/). 

The following fact can be easily verified by induction on the number of message exchanges in 
a private coins protocol (please refer to [5] for an explicit proof). It is also implicit in the cut and 
paste property of private-coins protocol used in Bar-Yossef, Jayram, Kumar and Sivakumar [2]. 

Lemma 2.9. For any private-coin two-way communication protocol, with input XY ~ /x and 
transcript M , the joint distribution can be written as 

Pr[XYM = xym] = n(x,y)u x (m)u y (m), 
where u x : M — > [0, 1] and u y : M — > [0, 1], for all (x, y) € X x y. 
Smooth rectangle bound 

Let / C X x y x Z be a relation and e, 5 > 0. With a slight abuse of notation, we write 
f(x,y) d = {z G Z\ {x,y,z) G /}, and f~ l {z) d = {(x,y) : (x,y,z) G /}. 
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Definition 2.10. [13| (Smooth-rectangle bound) H The (e, <5)-smooth-rectangle bound of /, 
denoted by srec e5 (/), is defined as follows: 

srec £(5 (/) = f max{srec A s (f ) | A a distribution over X x y}; 
srec e,«5(/) = max{srec^(/) | z G Z}; 

srec £ % A (/) = max{rec^ A (<7) \ g C X x y x Z; Pr [f(x,y) + g(x,y)] < 5,es S x g (z) > e}; 

essg (z) d = Pr [g{x,y) = {z}] ; 
(x,j/)<-A 

rec z £ ' x (g) d = min^ (A fl ||A) | R is a rectangle in X x y, X(g~ 1 (z) n i?) > (1 -e)A(-R)}. 

When 5 = 0, the smooth rectangle bound equals the rectangle bound (a.k.a. the corruption 
bound) \29\ HJ \25\ \19\ H]. Definition 12.101 is a generalization of the one in [13], where it is only 
defined for boolean functions. The smooth rectangle bound is a lower bound on the two way public 
coin communication complexity. 

Lemma 2.11. Let fCXxyxZbea relation and e > 0. Then, 

4 

R £ 2 /8 (/) > srec ££2/8 (/) - log-. 

Proof. Let c d = srec £ e2/ / 8 (/)- Let X,z,g be such that recf' A (g) = c, ess A (z) > e and 

Pr [/(*,!/) # 5 (x,j/)] <e 2 /8- 
(x,i/)«-A 

If D A 2/ / 8 (/) > c — log(4/e) then we are done using Fact 12.81 

So lets assume for contradiction that D A 2 y 8 (/) < c — log(4/e). This implies that there exists a 
deterministic protocol ir for / with communication c — log(4/e) and distributional error under A 
bounded by e 2 /8. Since Pr^ ^^^ [f(x, y) ^ g(x,y)] < e 2 /8, the protocol tt will have distributional 
error at most e 2 /4 for g. Let M represent the message transcript of tt and let O represent protocol's 
output. We assume that the last [log \Z\~\ bits of M contain O. We have, 

1. Pr m< _7\./ [Pr [M = m] < 2 _c ] < e/4, since the total number of message transcripts in ir is at 
most 2 c - lo s( 4 / £ ). 

2. Pr m ^_A/[0 = z\ M = m] > e — £ 2 /4, since esSg(z) > £ and distributional error of ir under A is 
bounded by e 2 /4 for g. 

3. Pr m <-M[PY(x,y)<-(XY\M=m)K x ,y,0) £ g\ M = m] > 1 — e] > 1— e/4, since distributional error 
of 7r under A is bounded by e 2 /4 for g. 

Using all of above we obtain a message transcript m such that Pr[M = m] > 2~ c and (O = z\ M = 
m) and 

Pr [(x.y.O) € g\ M = ml > 1 - e. 



3 The function esSg(-) was introduced by Jain in P2], where it was defined over any subset in Z. A similar ex- 
tension of our definition and the subsequent results can also be given in the same manner. For simplicity, we just 
consider singleton subsets in Z. 
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This and the fact that the support of (XY\ M = m) is a rectangle, implies that reCe' A (g) < c, 
contradicting the definition of c. Hence it must be that D^ 2 / 8 (/) > c — log(4/e), which using 
Fact 12.81 shows the desired. 

□ 

3 Proof 

The following lemma builds a connection between the zero-communication protocols and the smooth 
rectangle bound. 

Lemma 3.1. Let f C X x y x Z and X'Y' be a distribution over X x y. Let c,e,5 > with 
25(2e + 5) 1 / 3 < 1. Lei II 6e a zero- communication public-coin protocol with input X'Y', public 

coin R, Alice's output A G Z U {_L}, and Bob's output B G Z U {!}. Lei X 1 !" 1 ^ 1 ^ 1 ^ 1 = f 
(X'y'ABiZj A = B ^ ±). Let 

1. Pr[A = B / _L] > 2~ c . 

s. - z'y'H < e. 

3. Prf^Y 1 ,^ 1 ) € /] > 1 -£. 
T/ien 

Sre 4'£+ 5 )V3 )5 (/) < 5(2£ + 5)1/3 + lQ S • 

Proo/. Let ei = f 25(2e+<5) 1 / 3 . Let <? C A"x^x2, and z G £ satisfy Pr( s ^.^ x / F /[/(a;, y) 7^ y)] < 
5 and essf Y> (z) >e\. It suffices to show that rectf' Y \g) < ff + 3 log f^. Since PrL4 = B / _L] > 
2" c , 

> s^y^jlx'y'i?) 

> E [S((X 1 y 1 ) r ||X'y / )] (from Fact UM- ( 2 ) 

■lvl V'V' 



Since ^Y 1 - X'Y'\\ < e, 

n, bO 6 ' y ) = - p £, v , [5(3;, y) = {z}] - e 

xyr-^X 1 Y 1 B 1 xy-^X'Y' 

24 

- ei_£> 25 ei " ^ 

Since Pr[(X 1 , Y 1 , A 1 ) G /] > 1— s, and P^( x ,y)-^x'Y'[f( x ^ u) 7^ 9(x, y)\ < 5, by item 2 of this lemma, 
we have 

Pr \(x,y,a)eg}> Pr [(x, y, a) G /] - 5 - £ > 1 - 2e - 5. (4) 



S 



By standard application of Markov's inequality on equations (|2J), fl3J), (@1), we get an ro, such that 

S((X^) ro \\x'Y')<^, (5) 
Pr [g(x,y)^{z})<l-12 Sl /25, and (6) 

Pr [(x,y,a)eg]> l-^ = i -A- ( 7 ) 

xya<-{X 1 Y 1 A 1 ) rQ E\ 3125 

Note that the distribution of (X 1 Y 1 ) ro is the distribution of X'Y' restricted to some disjoint rect- 
angles and then rescaled to make a distribution. Hence S((X 1 Y 1 ) n) \\X'Y'^ = S oo ((X 1 y 1 ) r0 || A'y') . 

Let X 2 Y 2 d = (X'Y'\ R = r ,A = B = z). Further note that the distribution of X 2 Y 2 is the 
distribution of X 1 Y 1 restricted to a rectangle and then rescaled to make a distribution. Now from 
© and ©, we have (X^ll^y 1 ) < log < log f . Hence, 



25 3125 



{X 2 Y 2 \\X'Y f ) < S OD (X 2 Y 2 \\X 1 Y 1 ) +S 00 (X 1 Y 1 \\X'Y' 

(3 \ 5c 
log— +— . (from©) 



Also, 



Thus rec^' y ' (o) < ^ + log < ^ + 3 log 25. 

□ 

The following is our main lemma. A key tool that we use here is a sampling protocol that 
appears in [18] (protocol n' as shown in Figure [1]), which is a variant of a sampling protocol that 
appears in [6], which in turn is a variant of a sampling protocol that appears in [5]. Naturally similar 
arguments and calculations, as in this lemma, are made in previous works [51[6j[T8], however with 
a key difference. In their setting ^ m u x {m)u y {m) = 1 for all (x,y). However in our setting this 
number could be much smaller than one for different (x,y). Hence our arguments and calculations 
deviate from previous works at several places significantly. 

Lemma 3.2 (Main Lemma). Let c > 0, < <5 < 1, < e < 1 be real numbers with 25(22e + <5) 1 / 3 < 
1. Let XYM be random variables jointly distributed over the set X x y x M such that the last 
[log | Z |] bits of M represents an element z £ Z. Let p be a distribution over X x y. There exist 
u x : M -> [0, 1] and u y : M -> [0, 1], for all (x, y) £ X xy. Lf it holds that, 

1. Pr [XYM = xym] = ^p(x,y)u x (m)uy(m), for all (x,y,m) € X x y x M., where q = f 
T,xymP{ x ^y) u x{m)u y {m); 

2. spry ||p) < e 2 /4 ; 

3. \{X : M\Y)+l{Y : M\X) < c; 
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def 



4- err f(XY M) < e, where eiif(XYM) = Pi xym ^xYM[(x,y,rh) 6 /], and rh represents the last 
[log \Z\~\ bits of m; 

then srec^ (22£+5)1/3 (5 (/) < f±|. 

Proof. Note by direct calculations, 

PrpTY = xy] = -p(x, y)a xy , where a xy = f ^u x (m)u y (m); 

m 

PrpT = x] = -p(x)aa;, where a x = f y^p(y|: 

1 dcf x 

Pr[Y = y] = -p(y)a y , where = ^p(x|y)c 



XJ/J 



Pv[X y = x] 
Pr[Y x = y] 



p(x\y)a 



xy . 



a 

p(y\x)a x . 



xy . 



a 



Pr[M xy = m}= u x (m)u y (m)/a xy ; 

^ r, r 1 u x (m) 
Pr[M x =m}- v ; 



a. 



^p(y|x)%(m) 



u x (m)v x (m) 



def 



where f x (m) = X] y p(z/|x)M y (m); 



(8) 
(9) 

(10) 

(11) 

(12) 
(13) 

(14) 
(15) 



Define 



Pr[M, 



TO 



^P{x\y)u x {m) 



u y (m)v y (m) 



a. 



where v y (m) d = ^2 X p{x\y)u x (m); 



def 



< - and 



ol x 
1 



< — and 
~ 2 



G 2 = {(x,y) : S(M^||M X .) + S(Af xy ||M w ) < c/e} 

u,,(m) a 
^ v ' < 2 A and 



3^(x,y): Pr 



v x (m) 



> 1 - 2e 



1 

< - 
~ 2 



We begin by showing that G\ n G 2 is a large set and also G\ n G 2 C G. 
Claim 3.3. 1. Pr (a . )y) ^ p [(x, y) S GJ > 1 - 6e, 

2. Pr M ^[(x,|/)6G 2 ] > l-3e/2, 

3- Pr (X)1 ,)«_p[(x,y) € Gi nG 2 ] > 1 - 15e/2, 

4. Gi n G 2 C G. 



(16) 



(17) 

(18) 
(19) 
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Alice's input is x. Bob's input is y. Common input is c,e,A4. 



1. Alice and Bob both set A d = ^f±i + 2, T d = § |M|2 A In \ and k = log(f (In §)). 

2. For % = 1, ••• ,T : 

(a) Alice and Bob, using public coins, jointly sample mj <— A4,oti,f3i ^— [0,2 A ], uniformly. 

(b) Alice accepts mj if cti < ^(m;), and (3i < 2^v x (m\). 

(c) Bob accepts mj if ctj < 2 A v y (m.i), and (3i < u y (uii). 

3. Let A = f {i € [T] : Alice accepts mi} and B d = {i £ [T] : Bob accepts mi}. 

4. Alice and Bob, using public coins, choose a uniformly random function h : M —> {0, l} k and 
a uniformly random string r € {0, l} fc . 

(a) Alice outputs _L if either A is empty or h(mi) ^ r (where i is the smallest element 
in non-empty A). Otherwise, she outputs the element in Z, represented by the last 
[log \Z\~\ bits of rrii. 

(b) Bob finds the smallest j € B such that h(mj) = r. If no such j exists, he outputs _L. 
Otherwise, he outputs the element in Z, represented by the last [log \Z\~\ bits of mj. 



Figure 1: Protocol IT 



Proof. Note item 1. and item 2. imply item 3. Now we show 1. Note that \\XY — p\\i < e/2. From 
Lemma 12.71 and , we have 



Pr 

{x,y)^p 



(J 



< 1/2 



> 1 - 2e. 



By the monotonicity of £i-norm, we have \\X — px\\\ < § an d ||A — py\\i < §. Similarly, from 
and (PTOl) we have 



Pr 

(x,y)<-p 



&x 
1 



< 1/2 



> 1 - 2e, and Pr 

(x,y)^p 



i _ 



< 1/2 



> 1 - 2e. 



By the union bound, item 1. follows. 

Next we show 2. From item 2 of Lemma 



E [S(M xy \\M x ) + S(M xy \\M y )] = l(X : M\Y) +I(Y : M\X) < c. 

{x,y)^XY 

Markov's inequality implies Pr^ y ^ X Y [( x > u) £ ^2] > 1 — £. Then item 2. follows by the fact that 
XY and p are e/2-close. 
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Finally we show 4. For any (x, y) GGifl G2, 
S{M xy \\M x ) <c/e 



Pr 

m-<—RI X y 

Pr 



Pr 

m<— M x 



< 2— 



Pr [M xy = m] ^ £/£±i 
Pr [M x = m] 
u y (m)a 



> 1 - e (from Fact [23 



< 2" 



Similarly, 



By the union bound, 



■yyilL)UL X 

v x (m)a xy 

u y( m ) < 2 A 
v x (m) 



Pr 

m-<—M x 



> 1 - e (from CE} and (fill ) 



> 1 — e. ((x, 1/) € Gi and the choice of A) 

u x (m) 
v y (m) 



< 2 L 



> 1 -s. 



Pr 

mi—Mx 



u v (m) A , u x (m) a 
y ; ( < 2 A and ; ( < 2 A 
v x (m) Vy(m) 



> 1 - 2e, 



which implies (x, y) € G. Hence G\ fl G2 C G. 



□ 



Following few claims establish the desired properties of protocol II' (Figured]). 
Definition 3.4. Define the following events. 

• E occurs if the smallest i € A satisfies h(nii) = r and % € B. Note that E implies *4 7^ 0. 

• B c (subevent of E) occurs if E occurs and there exist j G B such that h(mj) = r and mi 7^ mj, 
where i is the smallest element in A. 

dof 

• H = - E- B c . 

Below we use conditioning on (x,y) as shorthand for "Alice's input is x and Bob's input is y" . 

Claim 3.5. For any (x,y) € G\ n G2, we have 
1. for all i e [T], 



q 3 
< Pr [Alice accepts nii| (x,y)] < 



and 



1 

2' |A^|2 A 

1 9 



2 |A4|2 A ' 



< Pr[Bob accepts mj| (x,y)] < — 



2 |A^|2 A -r n , 
where rn' is the internal randomness of protocol II'; 

2. Pr rn ,[5 c | (x,y),E]<e; 

3. Pr rn , [H\ (x,y)} > (1 - 4e) • 2~ k ~ A - 2 . 



2 |A4|2 A ' 
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Proof. 1. We do the argument for Alice. Similar argument follows for Bob. Note that u x (m), v x {m) £ 
[0, 1]. Then for all (x,y) e X x y, 

^ r a ! i / m 1 u x (m)v x (m) a x 
ft [Alice accepts mi | (*,y)] = W L = j^- 

Item 1 follows by the fact that £ Gi. 

2. Define £7, (subevent of E) when i is the smallest element of A. For all (x, y) £ Gi D G2, we 
have : 

Pr[5 c | (x.y),^] 
r n' 

= Pr[3j : j £ B and h(rrij) = r and m; 7^ m;| (x,y),i?j] 
r n' 

< Pr[i £ B and h(mj) = r and mj 7^ rrii| (from the union bound) 

je[T]j^i rn ' 

< V Pr[j £ B\ (x,y),Ei] • Pr[h(mj) = r| (x,y),E h j £ S,mj ^ m;] 

r n' r n' 

3g 1 

— ^ ' TTTTaZi ' ttet (using two-wise independence of h and item 1. of this Claim) 
|.M|2 £i+1 2 K 

< e. (from choice of parameters) 

Since above holds for every i, it implies Pr rn/ [B c \ (x,y),E] < e. 

3. Consider, 

Pr [E\ (x, y)} = Pr [A + 0\ (x, y)} • Pr [E\ A + 0, (x, y)} 

r n , r n , r n , 

> I 1 - (l - i • — ^ . ^ I • PrLEl ^ 7^ 0, (x,y)l (using item 1. of this Claim) 

\ V 2 I-m|2 a ; y r n , 

> (1 — e) ■ PrLEl A 7^ 0, (x,y)] (from choice of parameters) 

r n' 

= (1 - e) ■ Pr[h(mi) = r| A 7^ 0, (x,y)] Pr[i £ B\ i £ A, h(mi) = r, (x,y)] 
r n' r n' 



(l-e)-2- 



(from here on we condition on i being the first element of A) 
\B\iEA, (x,y)} 

k Pr rn , [i £ B and i £ A\ (x, y)] 



(l-e)-2- fc -Pr[i£i3|i£A (x,y)} 
r n' 



Pr rn , [i £ A\ (x, y)\ 



2 

> — (1 - e) ■ 2~ k ■ \M\2 A • Prfi £ B and i £ .41 (using item 1. of this claim) 

3<7 r n , 

= ^(l-e)-2- fc -|A^|2 A . j^^rnm{u x (m),2 A v y (m)}-mm{u y (m),2 A v x (m)} 

(from construction of protocol II') 
2 n „\ o-fc i a^ioA M a; (m)uj / (m) 



>-(l-e).2- fc .|M|2 A . ]T 



13 



(where G xy = f {m : u x (m) < 2 A v y (m) and u y (m) < 2 A v x (m)}) 



2 , . i, ... i a a™ v — v u r (m)u v (m) 
(1 -,)•!!-»• -pfj-jk £ _-U-l 

1 



> ^(1 - e) ■ 2" fc - A • Pr [m 6 G«,] (since (x, y) G Gi and CE])) 

3 m<r~M xy 

> -(1 - e) ■ 2~ k ~ A • (1 - 2e) (since (x, y) G G, using item 4. of Claim [33 

> (1 - 3e) • 2- fc - A ~ 2 . 

Finally, using item 2. of this Claim. 

Px[H\ (x,y)]=Pr[E\ (x,y)](l-Pi[B c \ (x,y),E}) > (1 - 4e) • 2 

r n' r n' r n' 



-jfc-A-2 



□ 



Claim 3.6. Pr p>rn , [H] > (1 - f e) • 2~ fc - A ~ 2 . 
Proof. 

23 

Pr[tf] > ^ p(x,y)HH\ (x,y)} > (l-4e)-2"^ A " 2 £ p(x,y) > (I- £ ). 2 - k ~ A ~ 2 . 

(x,j/)eGinG* 2 (x,y)GGinG 2 

The second inequality is by Claim [3~3| item 3, and the last inequality is by Claim [3731 item 3. □ 

The following claim helps us establish a crucial property of IT. The property is that the bad 
inputs (x, y) for which the distribution of n"s sample for M, conditioned on non-abort, deviates a 
lot from the desired, their probability is nicely reduced in the final distribution of IT, conditioned 
on non-abort. This helps us to argue that the joint distribution of inputs and the transcript in n', 
conditioned on non-abort, is still close in t\ distance to XYM. 

Claim 3.7. Let AB and A'B' be random variables over A\ x B\ and h : A\ — > [0, +oo) be a 
function. Suppose for any a G A\, there exist functions f a ,g a '■ B\ — > [0, +oo), such that 

1- Z~2 a ,b h (a)fa(b) = I, and Pv[AB = ab] = h(a)f a (b); 

2. f a (b) > g a (b), for all (a,b) e Ai x B i; 

3. Pt[A'B' = ab] = h(a)g a (b)/C, where C = ^ b h(a)g a (b); 

4. Pr a ^[Pr 6 ^ Sa [f a (b) = g a (b)] > 1 - <5J > 1 - <5 2 , for ^ G [0, 1),<5 2 G [0, 1). 
Then \\AB - A'B'\\ l < 5i + 5 2 . 

Proof. Set G d = {(a, b) : f a {b) = g a {b)}. By condition 4, Pr (a , b) ^ AB [(a, b) G G] > 1 - 8 X - 5 2 . Then 
C = Y j h{a)g a {b) > V h{a)f a {b)= Pr [(a, 6) G G] > 1 - 8 X - S 2 . (20) 

a,b a,b:(a,b)£G 
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We have 

\\AB - A'BX = l -Y. \Ko)fa(b) - ^h(a)g a (b)\ 

a,b 



< \ E (\ h (o)fa(b) ~ h(a)g a (b)\ + \h(a)g a (b) - ^h(a)g a (b 

a,b ^ 

1 / - 1 — C V \ 

< 2 I XI ( h< y a )fa{b) - h(a)g a {b)) H — h(a)g a (b) J (using item 2. of this claim) 

\ a,b a,b J 



< - 

~ 2 



\\ Yl Ka)f a (b) + l-C 

\a,b:{a,b)(£G 

1 ( Pr [(a, b) i G] + 1 - C ) < 5 1 + 5 2 . (from (H 

2 \(a,6)<-AB 



□ 



Claim 3.8. Let the input of protocol LT' be drawn according to p. Let X 1 Y 1 M 1 represent the 
input and the transcript (the part of the public coins drawn from A4) conditioned on H. Then we 



have \\XYM - X^M 1 ^ < We. Note that this implies that 



X 1 Y 1 A 1 B 1 - XYMM 



< We, 

l 

where M represents the last [log \Z\~\ bits of M and A 1 ^ 1 represent outputs of Alice and Bob 
respectively, conditioned on H . 



Proof. For any (x,y), define 



w xy {m) = f min ju x (m), 2^v y (m)} • min \u y {rn), 2 A f x (m)} . 
From step 2 (a),(b),(c), of protocol IT', Pr [M 1 X 1 Y' 1 = mxy] = ^p(x,y)w xy (m), where C 



Pr 

(x,y)t-XY 



Pr [w xy (m) = u x (m)u y (mj\ > 1 - 2e 

m<—M xy 



Pr \(x, y) G G] > 1 - 8e. 

(x,y)^XY 



The last inequality above follows using items 3. and 4. of Claim 13.31 and the fact that XY and p 
are e/2-close. 

Finally using Claim O (by substituting A <- XY,B <r- M, A' X l Y l ,B' <- M l ,h <- 
g>/(x,j/)( m ) u x {rn)u y {m) and g( x ^ y ){m) <— w xy (m)), we get that H^l^M 1 — IFM^ < lOe. 

□ 

We are now ready to finish the proof of Lemma [3.21 Consider the protocol II'. We claim that it 
satisfies Lemma f3.1l by taking the correspondence between quantities in Lemma [3. II and Lemma l3.2l 
as follows : c <— (c/e 2 + 3/e),e <— lie and X'Y' distributed according to p. 

Item 1. of Lemma O is implied by Claim ESI since (1 - f e) ■ 2~ k ^ A ~ 2 > 2^( c / £2 + 3 / £ ), from 
choice of parameters. 

Item 2. of Lemma O is implied since \\X^ < \\X^ 1 -17^ + \\XY - p\^ < ^-e, 

using item 2. in Lemma |3.2[ Fact 12.41 and Claim [3781 
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Item 3. of Lemma EE]is implied since err f (X 1 Y l M l ) < erv f (XYM) + WX^M 1 - XYM^ < 
lie, using item 4. in Lemma 13.21 and Claim [3781 
This implies 

v , ^ c / e2 + 3 / e , , 1 c + 3 



□ 



Theorem 3.9. Let X,y,Z be finite sets, f Q X x y x Z be a relation, and t > 1 be an integer. 
For any < e < 1, < S < 1 w'i/i e' d = 25(22e + J) 1 / 3 < 1, it holds that 



Proof. Set <5i = f e 2 /32. Given distribution ^lonA'xy, define c = e 1 / 3 srec^, s (f) — and XY ~ /A 
By Fact 12.81 it suffices to show 

Let LT be a deterministic two-way communication protocol, that computes / , with total com- 
munication 5\ct bits. The following claim implies that the success of IT is at most (1 — e)L 5l *J, and 
this shows the desired. 

□ 

Claim 3.10. For each i € [t], define a binary random variable Tj € {0,1}, which represents the 
success of Q on the z'-th instance. That is, Tj = 1 if the protocol computes the z'-th instance of / 
correctly, and Tj = otherwise. Let t' = f [5±tj. There exists t' coordinates ■ ■ ■ ,if} such that 
for each 1 < r < if — 1, 

1. either Pr [T^ = l] < (1 - ef or 

2. Pr [T ir+1 = 1| rW = 1] < 1 - e, where T« d = 7] -=i ?V 

Proof. Suppose we have already identified r coordinates, z'i, • • • ,i r satisfying that Pr [T^] < 1 — e 
and Pr [T ij+1 = 1| T^) = l] < 1 - e for 1 < j < r - 1. If Pr [T^ = l] < (1 - e)*', then we are 
done. So from now on we assume Pr [T^ = l] > (1 — e) 1 > 2~ Slt . Here we assume r > 1. Similar 
arguments also work when r = 0, that is for identifying the first coordinate, which we skip for the 
sake of avoiding repetition. 

Let D be a random variable uniformly distributed in {0, 1} and independent of XY. Let 
Ui = Xi if Di = 0, and Ui = Y^ if Di = 1. For any random variable L, define L 1 = (L\T^ = 
1). If L = L\---Lt, define L_j = L\ ■ ■ ■ L^iL^i ■ ■ ■ Lf. Let C = f {ii,-- - , i r } ■ Define Ri = f 

D-iU-iXc\j[i~i]Yc\j[i-i]- 

Now let us apply Lemma [3721 by substituting XY <— XjY^M <- RjM 1 ,e <— e and c <h- 
16(5i(c+l). Condition 1. in Lemma l3.2l is implied by Claim 13.111 Conditions 2. and 3. are implied 
by Claim I37T21 Also we have srec 25(22 £ +<5) 1 /3 ,$(/) ^ 16 ^ 1 ^j 3 1 ^ +3 , by our choice of c. Hence condition 
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4. must be false and hence errj (^XjY^M 1 ^ = erry yXj Y^RjM^J > e. This shows condition 2. of 
this Claim. 

□ 

Claim 3.11. Let TZ denote the space of Rj . There exist functions u X j , Uy^ : TZ x M ->■ [0, 1] for all 
{xj,yj) S X x y and a real number q > such that 

Proof. Note that is independent of Rj. Now consider a private-coin two-way protocol IIi 

with input XjYj as follows. Let Alice generate Rj and send to Bob. Alice and Bob then generate 
(X-j) x . r . and (Y-j)y jrj , respectively. Then they run the protocol II. Thus, from Lemma 12.91 

Pr[XjYjRjM = xyjrm] = fj,(xj,yj) ■ v Xj (rj,m) ■ v yj (rj,m), 

where v Xj ,v y . : M. — > [0, 1] , for all (xj,yj) G X x y. 

Note that conditioning on = 1 corresponds to choosing a subset, say S, of TZ x Ai. Let 



def 



^2 ii\.v r !) J )c J .ir J .ni)r !i (r J .ni) . 

'j Vj rjm:(rj,m)£S 



Then 

Pr [ X j Y j R j M = x jyj r j m ] = -^{xjiUj^xj (rj,m)v yj (rj,m), 

for (rj,m) € 5 and Pr XjY^RjM = XjyjTjm = otherwise. 
Now define 

u Xj (rj , m) = v Xj (rj , m) , and u Vj (rj,m) = v Vj (rj,m), 
for (rj,m) € 5 and define them to be otherwise. The claim follows. □ 

Claim 3.12. If Pr [T^ = l] > 2~ Slt , then there exists a coordinate j (£ C such that 



2 



S (Xj Y 3 l \\XjYj) <Ux = —, and 

l(X) : M 1 R 1 j \Y J 1 ) +I(y/ : M 1 R)\xj) < 165 1 (c+ 1). 

Proof. Following exactly as in the proof of Claim III. 6 in [15], there exists a coordinate j ^ C, such 
that 

H x M\\ x i Y i) ^ 8S ^ 

< E Rl ^[ S (( y /)^H^^)]+ , E t S ((4)^11 (^) % )] <1Wi, and 

l(X] : M 1 )^)^/) +I(y/ : M l \R)X}) < l6S lC . 
Using Fact 12. 6| the second inequality implies 

l(Xj:R}\Yl)+l(Yl:R 1 j \Xj)<166 1 . 

The claim follows by adding the last two inequalities and using the chain rule for mutual information. 

□ 
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Conclusion and open problems 



We provide a strong direct product result for the two-way public-coin communication complexity 
in terms of an important and widely used lower bound method, the smooth rectangle bound. Some 
natural questions that arise are: 

1. Is the smooth rectangle bound a tight lower bound for the two-way public-coin communication 
complexity for all relations? If yes, this would imply a strong direct product result for the two- 
way public-coin communication complexity for all relations, settling a major open question 
in this area. To start with we can ask: Is the smooth rectangle bound a polynomially tight 
lower bound for the two-way public-coin communication complexity for all relations? 

2. Or on the other hand, can we exhibit a relation for which the smooth rectangle bound is 
(asymptotically) strictly smaller than its two-way public-coin communication complexity? 

3. Can we show similar direct product results in terms of possibly stronger lower bound methods 
like the partition bound and the information complexity? 

4. It will be interesting to obtain new optimal lower bounds for interesting functions and relations 
using the smooth rectangle bound, implying strong direct product results for them. 
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